Logical Methods in Computer Science 
Vol. 11(3:13)2015, pp. 1-22 
www.lmcs-online.org 


Submitted Nov. 14, 2014 
Published Sep. 17, 2015 


LEARNING REGULAR LANGUAGES 
OVER LARGE ORDERED ALPHABETS 

IRINI-ELEFTHERIA MENS AND ODED MALER 

VERIMAG, CNRS and University of Grenoble, France 
e-mail address: {irini-eleftheria.mens,oded.maler}@imag.fr 


Abstract. This work is concerned with regular languages defined over large alphabets, 
either infinite or just too large to be expressed enumeratively. We define a generic model 
where transitions are labeled by elements of a finite partition of the alphabet. We then 
extend Angluin’s L* algorithm for learning regular languages from examples for such au¬ 
tomata. We have implemented this algorithm and we demonstrate its behavior where 
the alphabet is a subset of the natural or real numbers. We sketch the extension of the 
algorithm to a class of languages over partially ordered alphabets. 


Introduction 

The main contribution of this paper is a generic algorithm for learning regular languages 
defined over a large alphabet S. Such an alphabet can be infinite, like N or M or just so 
large, like B"' for very large n or large subsets of N, so that it is impossible or impractical 
to treat it in an enumerative way, that is, to write down the entries of the transition 
function S{q,a) for every a € S. The obvious solution is to use a symbolic representation 
where transitions are labeled by predicates which are applicable to the alphabet in question. 
Learning algorithms infer an automaton from a finite set of words (the sample) for which 
membership is known. Over small alphabets, the sample should include the set S of all 
the shortest words that lead to each state (access sequences) and, in addition, the set S' ■ S 
of all their S-continuations. Over large alphabets this is not a practical option and as an 
alternative we develop a symbolic learning algorithm over symbolic words which are only 
partially backed up by the sample. In a sense, our algorithm is a combination of automaton 
learning and learning of non-temporal predicates. Before getting technical, let us discuss 
briefly some motivation. 

Finite automata are among the corner stones of Computer Science. From a practical 
point of view they are used routinely in various domains ranging from syntactic analysis, 
design of user interfaces or administrative procedures to implementation of digital hard¬ 
ware and verification of software and hardware protocols. Regular languages admit a very 
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nice, clean and comprehensive theory where different formalisms such as automata, logic, 
regular expressions, semigroups and grammars are shown to be equivalent. The problem 
of learning automata from examples was introduced already in 1956 by Moore [Moo56] . 
This problem, like the problem of automaton minimization, is closely related to the Nerode 
right-congruence relation over words associated with every language or sequential function 
[Ner58| . This relation declares two input histories as equivalent if they lead to the same 
future continuations, thus providing a crisp characterization of what a state in a dynam¬ 
ical system is in terms of observable input-output behavior. All algorithms for learning 
automata from examples, starting with the seminal work of Gold |Gol72] and culminating 
in the well- known L* algorithm of Angluin |Ang87| are based on this concept [DIHIO] . 

One weakness, however, of the classical theory of regular languages is that it is rather 
“thin” and “flat”. In other words, the alphabet is often considered as a small set devoid of any 
additional structure. On such alphabets, classical automata are good for expressing and 
exploring the temporal (sequential, monoidal) dimension embodied by the concatenation 
operations, but less good in expressing “horizontal” relationships. To make this statement 
more concrete, consider the verification of a system consisting of n automata running in 
parallel, making independent as well as synchronized transitions. To express the set of joint 
behaviors of this product of automata as a formal language, classical theory will force you 
to use the exponential alphabet of global states and indeed, a large part of verification is 
concerned with fighting this explosion using constructs such as BDDs and other logical forms 
that exploit the sparse interaction among components. This is done, however, without a 
real interaction with classical formal language theory (one exception is the theory of traces 
|DR.95] which attempts to treat this issue but in a very restricted context). 

These and other considerations led us to use symbolic automata as a generic framework 
for recognizing languages over large alphabets where transitions outgoing from a state are 
labeled, semantically speaking, by subsets of the alphabet. These subsets are expressed 
syntactically according to the specific alphabet used: Boolean formulae when S = B”’ or 
by some classes of inequalities when E C N or S C M. Determinism and completeness of 
the transition relation, which are crucial for learning and minimization, can be enforced 
by requiring that the subsets of E that label the transitions outgoing from a given state 
form a partition of the alphabet. Such symbolic automata have been used in the past for 
Boolean vectors |H,T.T+95| and have been studied extensively in recent years as acceptors 
and tran sducers where transitions are guarded by predicates of various theories [Hvm 
IVHL+12] . 

Readers working on program verification or hybrid automata are, of course, aware of 
automata with symbolic transition guards but it should be noted that in the model that 
we use, no auxiliary variables are added to the automaton. Let us stress this point by 
looking at a popular extension of automata to infinite alphabets, initiated in |KF94| using 
register automata to accept data languages (see [BLPIO] for a good exposition of theoretical 
properties and |HSJG1^ for learning algorithms). In that framework, the automaton is 
augmented with additional registers that can store some input letters. The registers can 
then be compared with newly-read letters and influence transitions. With register automata 
one can express, for example, the requirement that the password at login is the same as the 
password at sign-up. This very restricted use of memory makes register automata much 
simpler than more notorious automata with variables whose emptiness problem is typically 
undecidable. The downside is that beyond equality they do not really exploit the potential 
richness of the alphabets and their corresponding theories. 
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Our approach is different: we do allow the values of the input symbols to influence 
transitions via predicates, possibly of a restricted complexity. These predicates involve 
domain constants and they partition the alphabet into finitely many classes. For example, 
over the integers a state may have transitions labeled by conditions of the form ci < x < C 2 
which give real (but of limited resolution) access to the input domain. On the other hand, 
we insist on a finite (and small) memory so that the exact value of x cannot be registered 
and has no future influence beyond the transition it has triggered. Many control systems, 
artificial (sequential machines working on quantized numerical inputs) as well as natural 
(central nervous system, the cell), are believed to operate in this manner. The automata that 
we use, like the symbolic automata and transducers studied in |HV11[|VHL'*~12[IVB12| . are 
geared toward languages recognized by automata having a large alphabet and a relatively- 
small state space. 

We then develop a symbolic version of Angluin’s L* algorithm for learning regular 
sets from queries and counter-examples whose output is a symbolic automaton. The main 
difference relative to the concrete algorithm is that in the latter, every transition S{q,a) 
in a conjectured automaton has at least one word in the sample that exercises it. In the 
symbolic case, a transition 6{q,a) where a stands for a set of concrete symbols, will be 
backed up in the sample only by a subset of a. Thus, unlike concrete algorithms where a 
counter-example always leads to a discovery of one or more new states, in our algorithm it 
may sometimes only modify the boundaries between partition blocks without creating new 
states. There are some similarities between our work and another recent adaptation of the 
L* algorithm to symbolic automata, the S* algorithm of [BB13j . This work is incomparable 
to ours as they use a richer model of transducers and more general predicates on inputs and 
outputs. Consequently their termination result is weaker and is relative to the termination 
of the counter-example guided abstraction refinement procedure. 

The rest of the paper is organized as follows. In Section [T] we provide a quick summary 
of learning algorithms over small alphabets. In Section [2] we define symbolic automata and 
then extend the structure which underlies all automaton learning algorithms, namely the 
observation table, to be symbolic, where symbolic letters represent sets, and where entries 
in the table are supported only by partial evidence. In Section H] we write down a symbolic 
learning algorithm, an adaptation of L* for totally ordered alphabets such as M or N and 
illustrate the behavior of a prototype implementation. The algorithm is then extended to 
languages over partially ordered alphabets such as and where in each state, the labels 
of outgoing transition from a monotone partition of the alphabet are represented by finitely 
many points. We conclude by a discussion of past and future work. 

1. Learning Regular Sets 

We briefly survey Angluin’s L* algorithm |Ang87| for learning regular sets from membership 
queries and counter-examples, with slightly modified definitions to accommodate for its 
symbolic extension. Let S be a finite alphabet and let S* be the set of sequences (words) 
over S. Any order relation < over S can be naturally lifted to a lexicographic order over 
S*. With a language L C S* we associate a characteristic function f : T,* ^ —}, where 

f{w) = -|- if the word w £ T,* belongs to L and f{w) = —, otherwise. 

A deterministic finite automaton over S is a tuple A = {'E,Q,6,qo, F), where Q is a 
non-empty finite set of states, qo £ Q is the initial state, <5 : Q x E —>■ Q is the transition 
function, and F C Q is the set of final or accepting states. The transition function 6 can 
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be extended to <5 : Q x S* —)■ Q, where 5[q, e) = q, and 5{q, u ■ a) = S{6{q, u), a) for q G Q, 
a G T, and u G T,*. A word w gT,* is accepted by A if 6{qo, w) G F, otherwise w is rejected. 
The language recognized by A is the set of all accepted words and is denoted by L{A). 

Learning algorithms, represented by the learner, are designed to infer an unknown 
regular language L (the target language). The learner aims to construct a hnite automaton 
that recognizes L by gathering information from the teacher. The teacher knows L and can 
provide information about it. It can answer two types of queries: membership queries, i.e., 
whether a given word belongs to the target language, and equivalenee queries, i.e., whether 
a conjectured automaton suggested by the learner is the right one. If this automaton fails 
to accept L the teacher responds to the equivalence query by a counter-example, a word 
miss-classified by the conjectured automaton. 

In the L* algorithm, the learner starts by asking membership queries. All information 
provided is suitably gathered in a table structure, the observation table. Then, when the 
information is sufficient, the learner constructs a hypothesis automaton and poses an equiv¬ 
alence query to the teacher. If the answer is positive then the algorithm terminates and 
returns the conjectured automaton. Otherwise the learner accommodates the information 
provided by the counter-example into the table, asks additional membership queries until 
it can suggest a new hypothesis and so on, until termination. 

A prefix-closed set S' l±l ii C S* is a balanced Yi-tree if Va G S: 1) For every s G S 
s ■ a G S L) R, and ^ For every rGR, r-a^SUR. Elements of R are called boundary 
elements or leaves. □ 

Definition 1.1 (Observation Table). An observation table is a tuple T = {Ti, S, R, E, f) 
such that S is an alphabet, S'Ui? is a balanced S-tree, E is a subset of S* and / : {SUR)-E —>■ 
{—, -|-} is the classihcation function, a restriction of the characteristic function of the target 
language L. 

The set (S U ii) • E is the sample associated with the table, that is, the set of words whose 
membership is known. The elements of S admit a tree structure isomorphic to a spanning 
tree of the transition graph rooted in the initial state. Each s G S corresponds to a state 
q of the automaton for which s is an access sequence, one of the shortest words that lead 
from the initial state to q. The elements of R should tell us about the back- and cross-edges 
in the automaton and the elements of E are “experiments” that should be sufficient to 
distinguish between states. This works by associating with every s € S' U i? a specialized 
classihcation function /^ : E —>■ dehned as /s(e) = f{s ■ e), which characterizes the 

row of the observation table labeled by s. To build an automaton from a table it should 
satisfy certain conditions. 

Definition 1.2 (Closed, Reduced and Consistent Tables). An observation table T is: 

• Closed if for every r G R, there exists an s € S, such that fr = fs', 

• Reduced if for every s, .s' G S fs ^ fs'] 

• Consistent if for every s, s' G S, fs = fs' implies fs-a = fs'-a,^o. G S. 

Note that a reduced table is trivially consistent and that for a closed and reduced table 
we can dehne a function g : R ^ S mapping every r G R to the unique s G S such that 

fs = fr- Erom such an observation table T = (S, S, R, E, f) one can construct an automaton 

^We use ttl for disjoint union. 



LEARNING REGULAR LANGUAGES OVER LARGE ORDERED ALPHABETS 


5 


At = (s, Q, qo, <5, F) where Q = S, go = e, F = {s e S : fs{e) = +} and 

\ _ / 'S ■ ® when s ■ a € S 

A,o.) I g{s ■ a) when s • a € i? 

The learner attempts to keep the table closed at all times. The table is not closed when 
there is some r G i? such that fr is different from fg for all s € S'. To close the table, the 
learner moves r from R to S and adds the S-successors of r, i.e., all words r ■ a for a € S, 
to R. The extended table is then filled up by asking membership queries until it becomes 
closed. 

Variants of the L* algorithm differ in the way they treat counter-examples, as described 
in more detail in [BR04] . The original algorithm |Ang87| adds all the prefixes of the counter¬ 
example to S and thus possibly creating inconsistency that should be fixed. The version 
proposed in [MP95| for learning w-regular languages adds all the suffixes of the counter¬ 
example to E. The advantage of this approach is that the table always remains consistent 
and reduced with S corresponding exactly to the set of states. A disadvantage is the possible 
introduction of redundant columns that do not contribute to further discrimination between 
states. The symbolic algorithm that we develop in this paper is based on an intermediate 
variant, referred to in [BR04| as the reduced observation algorithm, where some prefixes of 
the counter-example are added to S and some suffixes are added to E. 

Example 1.3. We illustrate the behavior of the L* algorithm while learning a language L 
over E = {1, 2,3,4,5}. We use the tuple {w, -|-) to indicate a counter-example w € L rejected 
by the conjectured automaton, and {w, —) for the opposite case. Initially, the observation 
table is Tq = (E, S, R, E, f) with S = E = {e} and i? = E and we ask membership queries 
for all words in (S D R) ■ E to obtain table Tq, shown in Fig. [TJ The table is not closed 
so we move word 1 to S, add its continuations, 1 • E to i? and ask membership queries to 
obtain table Ti which is now closed. We construct an hypothesis Ai (Fig. [2]) from this table, 
and pose an equivalence query for which the teacher returns counter-example (3 • 1, —). We 
add 3 • 1 and its prefix 3 to set S and add all their continuations to the boundary of the 
table resulting table T 2 of Fig. [TJ This table is not consistent: two elements e and 3 in 
S are equivalent but their successors 1 and 3 • 1 are not. In order to distinguish the two 
strings we add to E the suffix 1 and end up with a closed and consistent table T 3 . The 
new hypothesis for this table is ^ 3 , shown in Fig. [2| Once more the equivalence query will 
return a counter-example, (1 • 3 • 3,—). We again add the counter-example and prefixes 
to the table, ask membership queries to fill in the table and solve the inconsistency that 
appears for 1 and 1 • 3 by adding suffix 3 to the table. The table corresponds now to the 
correct hypothesis A 5 , and the algorithm terminates. D 


2. Symbolic Automata 

In this section we introduce the variant of symbolic automata that we use. Symbolic au¬ 
tomata [HVllJ IVB12j give a more succinct representation for languages over large finite 
alphabets and can also represent languages over infinite alphabets such as N, M, or M”'. The 
size of a standard automaton for a language grows linearly with the size of the alphabet 
and so does the complexity of learning algorithms such as L*. As we shall see, symbolic au¬ 
tomata admit a variant of the L* algorithm whose complexity is independent of the alphabet 


size. 
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Figure 1. Observation tables for Example 11.31 



Figure 2. Hypotheses for Example 11.31 

Let S be a large, possibly infinite, alphabet, to which we will refer from now on as the 
concrete alphabet. We define a symbolic automaton to be an automaton over S where each 
state has a small number of outgoing transitions labeled by symbols that represent subsets 
of S. For every state, these subsets form a (possibly different) partition of S and hence the 
automaton is complete and deterministic. We start with an arbitrary alphabet viewed as 
an unstructured set and present the concept in purely semantic manner before we move to 
ordered sets and inequalities in subsequent sections. 

Let 5] be a finite alphabet, that we call the symbolic alphabet and its elements symbolic 
letters or symbols. Let ?/) : S ^ S map concrete letters into symbolic ones. The S-semantics 
of a symbolic letter a € 51 is defined as [a].^ = {a € S : V'(a) = and the set {[a]^ : a € S} 
forms a partition of S. We will often omit V’ from the notation and use [a] when ■0, which 
is always present, is clear from the context. The S-semantics can be extended to symbolic 
words of the form lu = ai • a 2 • • • € S* as the concatenation of the concrete one-letter 

languages associated with the respective symbolic letters or, recursively speaking, [e] = {e} 
and [ru • a] = [lu] • [a] for w G S*, a G S. 

Definition 2.1 (Symbolic Automaton). A deterministic symbolic automaton is a tuple 
A= where 

• S is the input alphabet. 
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• 5] is a finite alphabet, decomposable into 

9 xf; = {'ipg : q £ Q} is a family of surjective functions : S —>■ 

• <5 is a finite set of states, 

• <5 : Q X S —> (5 and S : Q x 'S ^ Q are the concrete and symbolic transition functions 
respectively, such that 6{q,a) = 5[q,'il)q{a)), 

• qo is the initial state and F is a set of accepting states. 

The transition function is extended to words as in the concrete case and the symbolic 
automaton can be viewed as an acceptor of a concrete language. When at q and reading 
a concrete letter a, the automaton will take the transition 5{q,a) where a is the unique 
element of Sg satisfying a £ [a]. Hence L{A) consists of all concrete words whose run leads 
from go to a state in F. A language L over alphabet S is symbolic recognizable if there 
exists a symbolic automaton A. such that L = L{A). 

Remark: The association of a symbolic language with a symbolic automaton is more subtle 
because we allow different partitions of S and hence different symbolic input alphabets 
at different states. The transition to be taken while being in a state q and reading a 
symbol a 0 is well defined only when [a] C [a'] for some a' £ Sg. Such a model can 
be transformed into an automaton which is complete over a symbolic alphabet which is 
common to all states as follows. Let 

Ti' = X)g, with the S-semantics [(ai,..., a„)] = [ai] n ... n [a„], 
q&Q 

and let S = {6 E S' : [ 6 ] 7 ^ 0}. Then we define A = (S, Q, 5, q^, F) where, by construction, 
for every 6 E S and every q € Q, there is a unique a £ 'Eg such that [ 6 ] C [a] and hence one 
can define the transition function as S{q,b) = S{q,a). This model is more comfortable for 
language-theoretic studies but in the learning context it introduces an unnecessary blow¬ 
up in the alphabet size and the number of queries for every state. For this reason we 
stick in this paper to the Definition 12.11 which is more economical. A similar approach of 
state-local abstraction has been taken in [IHS13] for learning parameterized language. The 
construction of S' is similar to the minterm construction of [DV 14| used to create a common 
alphabet in order to apply the minimization algorithm of Hopcroft to symbolic automata. 
Anyway, in our learning framework symbolic automata are used to read concrete and not 
symbolic words. D 

It is straightforward that for a finite concrete alphabet S the set of languages accepted 
by symbolic automata coincides with the set of recognizable regular languages over S. More¬ 
over, even when the alphabet is infinite, closure under Boolean operations is preserved. 

Proposition 2.2 (Closure under Boolean Operations). Languages accepted by deterministic 
symbolic automata are effectively closed under Boolean operations. 

Proof. Closure under complement is immediate by complementing the set of accepting states. 
For intersection the standard product construction is adapted as follows. Let Li,L 2 be 
languages recognized by the symbolic automata Ai = (S, Si, i/;!, Qi, (5i, 5i, goi, Fi), and 
A 2 = {E,E 2 ,i> 2 ,Q 2 ,d 2 A 2 ,qo 2 ,F 2 ), respectively. Let A= {T,,E,'tlj,Q,6,S,qo, F), where 

• Q = Qi X Q 2 , Qo = {qoi,qo2), F = Fi X F 2 , 

• For every (gi, ^ 2 ) G <3 

- = {{^ 1 , 0 . 2 ) E Si X S 2 I [ai] n [ 02 ] / 0} 

- V'(gi,q 2 )(«) = ('01,91 (a),-02,92(a)), Va E S 
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- ^((91,92), (ai, 02)) = ( 5 i(gi, ai), d 2 {q 2 , 02)), V(ai, 02) € ^{qi,q 2 ) 

It is sufficient to observe that the corresponding implied concrete automata Ai, A 2 and 
A satisfy 6 {{qi, q 2 ), a) = {6i{qi, a), S 2 {q 2 , 0 ,)) and the standard proof that L{A) = L{Ai) n 
L{A 2 ) follows. Closure under union and set difference is then evident. □ 

The above product construction is used to implement equivalence queries where both 
the target language and the current conjecture are represented by symbolic automata. A 
counter-example is found by looking for a shortest path in the product automaton from the 
initial state to a state in Fi x (Q 2 — F 2 ) U (Qi — Fi) x F 2 and selecting a lexicographically 
minimal concrete word along that path. 
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Figure 3. A symbolic automaton A with its symbolic transition function. 

Example 2.3. Figure [3] shows a symbolic automaton equivalent to automaton A 5 of 
Figure El The symbolic alphabets for the states are = {ao,ai}, = { 02 , 03 }, 
Sg 2 = { 04 , 05 , 06 , 07 }, Sgg = {og}, and the S-semantics for the symbols is [oq] = { 1 , 2 }, 
[oi] = {3,4,5}, [ 02 ] = {3}, [ 03 ] = {1,2,4,5}, etc.. The same automaton can accept a 
language over the uncountable alphabet S = [0,100) C M, defining ^ as shown in Figured! 



Figure 4. The concrete semantics of the symbols of automaton A of 
Fig. El when defined over S = [0,100) C M. 


3. Symbolig Observation Tables 

In this section we adapt observation tables to the symbolic setting. They are similar to the 
concrete case with the additional notions of evidences and evidence compatibility. 
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Definition 3.1 (Balanced Symbolic S-Tree). A balanced symbolic Ti-tree is a tuple (H, S, R, 
■0) where 

• S in R is a prefix-closed subset of "E* 

• 5] = l+)sg 5 Eg is a symbolic alphabet 

• 'll) = {V’sjsgs is a family of total surjective functions of the form 0s ; S —> Hg. 

It is required that for every s £ S and a £ Ss, s ■ a £ SU R and for any r £ R and a £ E, 
r ■ a ^ S U R . Elements of R are called boundary elements of the tree. 

We will use observation tables whose rows are symbolic words and hence an entry in the 
table will constitute a statement about the inclusion or exclusion of a large set of concrete 
words in the language. We will not ask membership queries concerning all those concrete 
words, but only for a small representative subset that we call evidence. 

Definition 3.2 (Symbolic Observation Table). A symbolic observation table is a tuple 
T = (S, 5], S, R, 0, E, f, fi) such that 

• S is an alphabet, 

• (S,S',i?, 0) is a balanced symbolic S-tree (with R being its boundary), 

• E is a subset of S*, 

• f : {S U R) ■ E ^ {—,+} is the symbolic classification function 

• : [S U R) ■ E ^ 2^ — {0} is an evidence function satisfying ^{w) C [m]. The image of 
the evidence function is prefix-closed: w ■ a £ fj,{w ■ a) ^ w £ fr{w). 

As for the concrete case we use fg ■ E ^ {—,+} to denote the partial evaluation of f to 
some symbolic word s £ SU R, such that, /s(e) = /(s • e). Note that the set E consists of 
concrete words but this poses no problem because elements of E are used only to distinguish 
between states and do not participate in the derivation of the symbolic automaton from 
the table. Concatenation of a symbolic word and a concrete one follows concatenation of 
symbolic words as defined above where each concrete letter a is considered as a symbolic 
letter a with [a] = {a} and /u(a) = a. The notions of closed, consistent and reduced table 
are similar to the concrete case. 

The set Mt = {SUR) ■ E is called the symbolic sample associated with T. We require 
that for each word w £ Mt there is at least one concrete w £ fi{w) whose membership in 
L, denoted by f{w), is known. The set of such words is called the concrete sample and is 
defined as Mt = {s ■ e : s £ fi{s),s £ S U R,e £ E}. A table where all evidences of the 
same symbolic word admit the same classification is called evidence-compatible. 

Definition 3.3 (Table Conditions). A table T = {E,E, S,R,'4),E, f, p) is 

• Closed if Vr £ R, 3s = g{r) £ S, fr = fs, 

• Reduced if Vs, s' £ S, fg ^ fg>, 

• Consistent if Vs, s' £ S, fg = fg> implies fg.a = fs'-a^o. £ Eg. 

• Evidence compatible if Vm £ Mt, \/wi,W 2 £ p{w),f{wi) = f{w 2 ). 

When a table T is evidence compatible the symbolic classification function f can be defined 
for every s £ (S U R) and e £ E as f{s ■ e) = /(s ■ e), s £ p{s). 

Theorem 3.4 (Automaton from Table). From a closed, reduced and evidence compatible 
table one can construct a deterministic symbolic automaton compatible with the concrete 
sample. 
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Proof. The proof is similar to the concrete case. Let T = (S, S, S, R, if, E, f, fi) be such a 
table, which is reduced and closed and thus a function g : R ^ S such that g{r) = s iff fr = 
fs is well defined. The automaton derived from the table is then A-t = (S, S, fj, Q, S, qq, F) 
where: 

• Q = S, qo = e 


• F = {seS\ fsie) = +} 

\ ( s ■ a when s ■ a G S 

• o : Q X 2j ^ Q IS dehned as o s,a = < , „ 

I when s ■ a G R 

By construction and like the L* algorithm, At classifies correctly the symbolic sample and, 
due to evidence compatibility, this holds also for the concrete sample. □ 


4. Learning Languages over Ordered Alphabets 

In this section we present a symbolic learning algorithm starting with an intuitive verbal 
description. The algorithmic scheme is similar to the concrete L* algorithm but differs in the 
treatment of counter-examples and the new concept of evidence compatibility. Whenever 
the table is not closed. S' U is extended until closure. Then a conjectured automaton 
At is constructed and an equivalence query is posed. If the answer is positive we are done. 
Otherwise, the teacher provides a counter-example leading to the extension of SUi? and/or 
E. Whenever such an extension occurs, additional membership queries are posed to hll the 
table. The table is always kept evidence compatible and reduced except temporarily during 
the processing of counter-examples. 

From now on we assume S to be a totally ordered alphabet with a minimal element 
qq and restrict ourselves to symbolic automata where the concrete semantics for every 
symbolic letter is an interval. In the case of a dense order like in M, we assume the intervals 
to be left-closed and right-open. The order on the alphabet can be extended naturally to a 
lexicographic order on S*. Our algorithm also assumes that the teacher provides a counter¬ 
example of minimal length which is minimal with respect to the lexicographic order. This 
strong assumption improves the performance of the algorithm and its relaxation is discussed 
in Section [71 

The rows of the observation table consist of symbolic words because we want to group 
together all concrete letters and words that are assumed to induce the same behavior in 
the automaton. New symbolic letters are introduced in two occasions: when a new state is 
discovered or when a partition is modified due to a counter-example. In both cases we set 
the concrete semantics [a] to the largest possible subset of S, given the current evidence 
(in the hrst case it will be S). As an evidence we always select the smallest possible a G [a] 
(oo when [a] = S). The choice of the right evidences is a key point for the performance of 
the algorithm as we want to keep the concrete sample as small as possible and avoid posing 
unnecessary queries. For inhnite concrete alphabets this choice of evidence guarantees 
termination. 

The initial symbolic table is T = (B,S, S, R,'ijj, E, f, g,), where 51 = {ao}, [cio] = 51, 
S = {e}, R = {ao}, E = {e}, and g{ao) = {oo}. The table is filled by membership queries 
concerning e and uq. Whenever T is not closed, there is some r G R such that fr 7 ^ fs 
for every s G S. To close the table we move r from R to S, recognizing it as a new state, 
and checking the behavior of its continuation. To this end we add to R the word r' = r ■ a, 
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Algorithm 1 The symbolic algorithm 
1: procedure Symbolic 
2 : learned = false 

3: Initialize the table T = (S, S, S, R, if, E, f, /i) 

4: 5] = {a}; V’e(«) = a,\/a T; 

5: 5 = {e}; i? = {a}; E = {e} 

6: /i(a) = {ao} 

7: Ask MQ on e and oq to fill / 

8: if T is not closed then 

9: Close 

10: end if 

11: repeat 

12 : if EQ{At) then 

13: learned = true 

14: else 

15: M = MU {ic} 

16: COUNTER-EX(ui) 

17: end if 

18: until learned 

19: end procednre 


> At is correct 
0 A counter-example w is provided 
t> Process counter-example 


Procedure 2 Close the table 

1 

procedure CLOSE 


2 

while 3r G i? such that Vs ^ S, fr A fs do 


3 

S' = SU {r} 

i> r becomes a new state 

4 

51' = S U {Unew} 


5 

A' = A ^ {Ar} with Ario) = Unew, Va G S 


6 

R' = (R- {r}) U {r ■ a^ew} 


7 

fi{r ■ anew) = fj-ir) ■ oq 


8 

Ask MQ for all words in {/i(r • anew) • e : e G E} 


9 

T = {E,^',S',R',A',E,f',A) 


10 

end while 


11 

end procedure 



where a is a new symbolic letter with [a] = S. We extend the evidence function by letting 
/i(r') = fi{r) ■ ao, assuming that all elements of S behave as ao from r. Once T is closed 
we construct a hypothesis automaton as described in the proof of Theorem 18.41 

When a counter-example w is presented, it is of course not part of the concrete sam¬ 
ple. A miss-classified word in the conjectured automaton means that somewhere a wrong 
transition is taken. Hence w admits a factorization w = u ■ b ■ v where u G S* and 6 G S 
is where the first wrong transition is taken. Obviously we do not know u and b in advance 
but know that this happens in the following two cases. Either b leads to an undiscovered 
state in the automaton of the target language, or letter b does not belong to the interval it 
was assumed to belong in the conjectured automaton. The latter case happens only when 
b does not belong to the evidence function. Since counter-example w is minimal, it admits 
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Procedure 3 Process counter-example 


10 : 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23: 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 


procedure CouNTER-Ex(t(;) 

Find a factorization w = u ■ b ■ v, b £ Ti, u, v & T,* such that 
3u £ Mt, u £ fi{u) and Vu' £ u ■ b ^ 

ii u £ S then 

Find a £ such that b £ [a] 

= S U {Unew} 

R' = RD {u ■ a„ew} 

IJ,{u ■ anew) = fj-(u) ■ b 

Ask MQ for all words in {fi{u • Unew) • e : e £E} 

{ V’it(a) if a ^ [a] 

a„ew if a G [a] and a >b 

a otherwise 

T = (S,S', S,R',i;',E,f,fi') 


0 u is already a state 
i> refine [al 


else 


S' = SU {u} 

if 6 = oo then 

S' = H U {Unew} 

^ u {i’u}, with V’it(a) = a 

R' = (R- {u}) U {u ■ anew} 

E' = EU {suffixes of 6 • u} 

n{u ■ anew) = • ao 

Ask MQ for all words in {fi{u ■ 
else 

s = S U {a„ew) ®new} 

V’' = u {'ipu}, with V’it(a) = 


> u is in the boundary 
> and becomes a state 


„Va G S 


R' = {R- {u}) U {u ■ anew 
E' = EU {suffixes of 6 • u} 

n{u ■ anew) = m('“) • «o; ■ a 

Ask MQ for all words in {{n{u 

end if 

T = iE,^',S',R',^P',E',f',^J,') 

end if 

if T is not closed then 
CLOSE 

end if 

end procednre 




“n 

an 


u ■ a' 


e : e £ E'} 


if a > 6 
otherwise 


new) = M('^) ■ b 
^new) U fJj(^U * Cl. 


/ 

new 


))-e:e£E'} 


a factorization w = u ■ b ■ v, where u is the largest prefix of w such that u £ fi{u) for some 
u £ S L) R but s - b ^ fJ'iu') for any word u' in the symbolic sample. We consider two cases, 
u £ S and u £ R. 

In the first case, when u £ S, u is already a state in the hypothesis but b indicates that 
the partition boundariues are not correctly defined and need refinement. That is, u ■ b was 
wrongly considered to be part of [u ■ a] for some a £ and thus b was wrongly considered 
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to be part of [a]. Due to minimality, all letters in [a] less than letter b behave like /i(a). We 
assume that all remaining letters in [a] behave like b and map them to a new symbol anew 
that we add to We then update 'ijju such that = “new for all a G [a],a > b, and 

= 'fpu{cL), otherwise. The evidence function is updated by letting /x(rr-anew) = fJ-iu) - b 
and u ■ a„ew is added to R. 

In the second case, the symbolic word u is part of the boundary. From the counterex¬ 
ample we deduce that u is not equivalent to any of the existing states in the hypothesis and 
should form a new state. Specifically, we find the prefix s that was considered to be equiv¬ 
alent to u, that is g{u) = s £ S. Since the table is reduced fu ^ /s' for any other s' G S. 
Because w is the shortest counter-example, the classification of s • 6 ■ u in the automaton is 
correct (otherwise s - b-v, for some s G [s] would constitute a shorter counter-example) and 
different from that oi u ■ b ■ v. We conclude that u is a new state, which is added to S. To 
distinguish between u and s we add to E the word b ■ v, possibly with some of its suffixes 
(see |BR04] for a more detailed discussion of counter-example processing). 

As ri is a new state we need to add its continuations to R. We distinguish two subcases 
depending on b. If 6 = oq, the smallest element of B, then a new symbolic letter anew is 
added to B, with [anew] = B and fj,{u ■ anew) = ^(w) • qq, and the symbolic word u ■ anew is 
added to R. If 6 / oq then two new symbolic letters, anew and ang^, are added to B with 
[“new] = {a : a < b}, [a'n^J = {a: a>b}, fi{u ■ anew) = /^(w) ■ oq and p(ri • a^^^^) = ■ b. 

The words u ■ a^ew and u ■ are added to R. 

A detailed description of the algorithm is given in Algorithm [T] and its major proce¬ 
dures, table closing and counter-example treatment are described in Procedures [2] and [3| 
respectively. A statement of the form B' = B U {a} indicates the introduction of a new 
symbolic letter a ^ B. We use MQ and EQ as shorthands for membership and equivalence 
queries, respectively. In the following we illustrate the symbolic algorithm as applied to a 
language over an inhnite alphabet. 

Example 4.1. Let B = [0,100) C M with the usual order and let L C B* be a target 
language. Fig. [5] shows the evolution of the symbolic observation tables and Fig. [ 6 ] depicts 
the corresponding automata and the concrete semantics of the symbolic alphabets. 

We initialize the table with S = {e}, R = {ao}, /r(ao) = {0} and E = {e} and ask 
membership queries for e (rejected) and 0 (accepted). The obtained table, Tq is not closed 
so we move ao to S, introduce B^q = {ai}, where ai is a new symbol, and add ag • ai 
to R with fj.{aQ • ai) = 0-0. Asking membership queries we obtain the closed table Ti 
and its automaton A-i. We pose an equivalence query and obtain (50,—) as a (minimal) 
counter-example which implies that all words smaller than 50 are correctly classified. We 
add a new symbol a 2 to Bg and redehne the concrete semantics to [ag] = {a < 50} and 
[a 2 ] = {a > 50}. As evidence we select the smallest possible letter, fj.{a 2 ) = 50, ask 
membership queries to obtain the closed table T 2 and automaton ^ 2 - 

For this hypothesis we get a counter-example (0 • 30, —) whose prefix 0 is already in the 
sample, hence the misclassification occurs in the second transition. We refine the alphabet 
partition for state ag by introducing a new symbol ag and letting [ai] = {a < 30} and 
[aa] = {a > 30}. Table T 3 is closed but automaton A-g is still incorrect and a counter¬ 
example (50 • 0, —) is provided. The prefix 50 belongs to the evidence of a 2 and is moved 
from the boundary to become a new state and its successor a 2 • a 4 , for a new symbol a 4 , 
is added to R. To distinguish a 2 from e, the suffix 0 of the counter-example is added to E 
resulting in T 4 which is not closed. The newly discovered state ag ■ ai is added to S, the 
filled table T 5 is closed and the conjectured automaton A 5 has two additional states. 
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To 



e 

e 

- 

do 
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Ti 



e 

e 

do 

+ 

do ■ di 
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T 2 



e 

e 

- 

do 
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do - di 

+ 

d2 

- 


Ts 



e 

e 

- 

ao 

+ 

do - di 

+ 

d2 

- 

do ■ do 
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T 4 



e 0 

e 

- + 

do 

+ + 

d2 

- 

ao - di 

+ - 

ao - as 

- 

d2 - d4 

- + 



e 0 

e 

- -b 

ao 

+ + 

d2 

- 

do ■ di 

+ - 

do ■ do 

- 

d2 - d4 
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ao - ai - as 

- 

d2 ■ do 
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e 

- + 

ao 

+ + 

d2 

- 

ao - di 

+ - 

do ■ as 

- 

d2 - d4 

- + 

do-di- as 

- 

d2 ■ do 

+ - 



e 0 

e 

- + 

do 

+ + 

d2 

- 

do • di 

+ - 

do ■ as 

- 

d2 - d4 

- + 

ao - ai - as 

- 


Ts 



e 0 

e 

- + 

do 

+ + 

d2 

- 

do ■ di 

+ - 

do ■ as 

- 

d2 - d4 

- + 

do-di- as 

- 

d2 ■ dQ 

+ - 

d2 ■ dj 

- 

d2 ■ dg 

+ + 


Figure 5. Observation tables for Example 14.11 

Subsequent equivalence queries result counter-examples (50-20, -f), (50-80, —) and (50- 
50 - 0, -b) which are used to refine the alphabet partition at state a 2 and modify its outgoing 
transitions progressively as seen in automata As, Ay and As, respectively. Automaton 
accepts the target language and the algorithm terminates. Q 

Note that for the language in Example 11.31 the symbolic algorithm needs around 30 queries 
instead of the 80 queries required by L*. If we choose to learn a language as the one described 
in Example 14.11 restricting the concrete alphabet to the finite alphabet S = {1,..., 100}, 
then L* requires around 1000 queries compared to 17 queries required by our symbolic 
algorithm. As we shall see in Section [6l the complexity of the symbolic algorithm does not 
depend on the size of the concrete alphabet, only on the number of transitions. 

5. Learning Languages over Partially-ordered Alphabets 

In this section we sketch the extension of the results of this paper to partially-ordered 
alphabets of the form S = where X is a totally-ordered set such as an interval [0, k) C 
M. Letters of E are d-tuples of the form x = (xi,... ,Xd) and the minimal element is 
0 = (0,..., 0). The usual partial order on this set is defined as x < y if and only if Xi < jji 
for alH = 1,..., d. When x < y and Xi / y* for some i the inequality is strict, denoted by 
X < y, and we say then that x dominates y. Two elements are incomparable, denoted by 
x||y, if Xi < Pi and Xj > pj for some i and j. 
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Figure 6 . Hypotheses and S-semantics for Example 14.11 


For partially-ordered sets, a natural extension of the partition of an ordered set into 
intervals is a monotone partition, where for each partition block P there are no three points 
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XI 


yi 



y2 

Xl 



ys 

X2 

X3 

Xl 


X4 y4 


y||x 

B+(x) 

y>x 

X 

y < X 

y||* 


(a) Backward and forward (b) Union of cones (c) An alphabet partition 

cone for x 


Figure 7 




Figure 8 . Modifying the alphabet partition for state u after receiving 
u ■ b ■ V as counter-example. Letters above b are moved from [a] to \a']. 

such that X < y < z, X, z G P, and y 0 P. We define in the following such partitions 
represented by a finite set of points. 

A forward cone P^(x) C S is the set of all points dominated by a point x € S (see 
Fig. [7a)l . Let P = {xi,... ,x;} be a set of points, then P+(P) = P’''(xi) U ... U as 

shown in Fig. [70 From a family of sets of points P = {Pq, ..., Pm-i}, such that Fq = {0} 
satisfying for every i: 1) Vy € Pj, 3x G Pj_i such that x < y, and 2) Vy G Pj, Vx G 
Pj-i, y X, we can define a monotone partition of the form V = {Pi, .. . ,Pm_i}, where 
P, = P+(Pi_i) - P+(P), see Fig. m 

A subset P of S, as defined above, may have several mutually-incomparable minimal 
elements, none of which being dominated by any other element of P. One can thus ap¬ 
ply the symbolic learning algorithm but without the presence of unique minimal evidence 
and minimal counter-example. For this reason a symbolic word may have more than one 
evidence. Evidence compatibility is preserved though due to the nature of the partition. 

The teacher is assumed to return a counter-example chosen from a set of incomparable 
minimal counter-examples. Like in the algorithm for totally ordered alphabet, every counter¬ 
example either discovers a new state or refines a partition. The learning algorithm for 
partially-ordered alphabets is similar to Algorithm [T] and can be applied with only a minor 
modification in the treatment of the counterexamples and specifically in the refinement 
procedure. Lines 6-8 of Procedure [3| should be ignored in the case where there exists a 
symbolic letter a', as illustrated in Fig. [8al such that f{u - b ■ e) = f{u ■ a' ■ e) for all e G P. 
In such a case, function if is updated as in line 9 by replacing a^ew by a' and b should 
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be added to In Fig. I8b[ one can see the partition after refinement, where all letters 

above b have been moved from [a] to \a']. 


To 
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- 
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e 
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ao 
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02 
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Oq ■ 03 

- 
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- -t 
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02 • Og 

-L - 

02 • 07 

- 
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02 
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02 • 07 
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Figure 9. Observation tables for Example 15.11 

Example 5.1. Let us illustrate the learning process for a target language L defined over 
S = [0,100]^. All tables, hypotheses automata and alphabet partitions for this example are 
shown in Figures [9l [TOl andfTTl respectively. 

The learner starts asking MQs for the empty word. A symbolic letter ag is chosen to 
represent its continuations with the minimal element of S as evidence, i.e., /i(ao) = (q) • The 
symbolic word ag is moved to S for the table Tq to be closed. The symbolic letter ai is added 
to the alphabet of state ag, and the learner asks a MQ for ([J) (q) , the evidence of the symbolic 
word agai. The first hypothesis automaton is .Ag with S-semantics [ao] = [ai] = S. The 
counter-example ((^q), —) refines the partition for the initial state. The symbolic alphabet 
is extended to = {ag,a 2 } with [a 2 ] = {x >= (gg)}, [ao] = E — [a 2 ], and ^ 7 ( 02 ) = (gg). 
The new observation table and hypothesis are Ti and Ai. Two more counter-examples will 
come to refine the partition for the initial state, {{^g), —) and ((^^g), —), that will modify the 
partition for the initial state, moving all letters greater than ) and (^g) to the S-semantics 
of a 2 as can be seen in ■02 and 'ipg respectively. 

After the hypothesis ^ 43 , the counter-example ((g) (g°g), —) adds a new symbol ag and 
a new transition in the hypothesis automaton. The counter-examples that follow, namely, 
((o)(o°)’“)’ ((o)(i 5 )’“)’ ( 0 ( 30 )’“) 'define the S-semantics for symbols in as 

shown in 'ipi-j. 

Then counter-example (( 5 g)(g) 5 +) is presented. As we can see, the prefix (^q) exist 
already in /x(a 2 ) and a 2 G R which means a 2 becomes a state, and to distinguish it from 
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Figure 10. Hypothesis automata for Example 15.11 

the state represented by the empty word the learner adds to E the suffix of the counter¬ 
example (q) . The resulting table Tg is not closed and aoai is moved to S. The new table Tg 
is closed and evidence compatible. The hypothesis has now four states and the symbolic 
alphabet and S-semantics for each state can be seen in ipg. The counter-examples that follow 
will refine the partition at state a 2 . The new transitions discovered and all refinements are 
shown in ^lo-ig and tpio — ^pls■ The language was learned using 20 membership queries 
and 17 counter-examples. D 


6. On Complexity 

The complexity of the symbolic algorithm is influenced not by the size of the alphabet but 
by the resolution (partition size) with which we observe it. Let L C S be the target language 
and let ^4. be the minimal symbolic automaton recognizing this language with state set Q 
of size n and a symbolic alphabet X) = 'Sg such that |Sq| < m for every q. 

Each counter-example improves the hypothesis in one out of two ways. Either a new 
state is discovered or a partition gets refined. Hence, at most n — 1 equivalence queries 
of the first type can be asked and n{m — 1) of the second, resulting in 0 {mn) equivalence 
queries. 

Concerning the size of the table, the set of prefixes S is monotonically increasing and 
reaches the size of exactly n elements. Since the table, by construction, is always kept 
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i’5 


i’6 


Figure 11. Alphabet partition for Example 15.11 ioart 1) 

reduced, the elements in S represent exactly the states of the automaton. The size of the 
boundary is always smaller than the total number of transitions in the automaton, that is 
mn — n + 1. The number of suffixes in E, playing a distinguishing role for the states of 
the automaton, range between log 2 n and n. Hence, the size of the table ranges between 
(n + m) log 2 n and n(mn + 1). 

For a totally ordered alphabet the size of the concrete sample coincides with the size of 
the symbolic sample associated with the table and hence the number of membership queries 
asked is O(mn^). For a partially ordered alphabet with each Fi defined by at most I points, 
some additional queries are asked. For every row in S, at most n{m — 1){1 — 1) additional 
words are added to the concrete sample, hence more membership queries might need to be 
asked. Furthermore, at most I — 1 more counter-examples are given to refine a partition. 
To conclude, the number of queries in total asked to learn language L is 0{mn‘^) if I < n 
and 0{lmn) otherwise. 
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7. Conclusion 

We have defined a generic algorithmic scheme for automaton learning, targeting languages 
over large alphabets that can be recognized by hnite symbolic automata having a modest 
number of states and transitions. Some ideas similar to ours have been proposed for the 
particular case of parametric languages [R.TRnfij and recently in a more general setting 
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Figure 11. Alphabet partition for Example 15.11 (part 2) 

[HSMlll IIHS131IBB13] including partial evidential support and alphabet rehnement during 
the learning process. 

The genericity of our algorithm is due to a semantic approach (alphabet partitions) but 
of course, each and every domain will have its own semantic and syntactic specialization in 
terms of the size and shape of the alphabet partitions. In this work we have implemented 
an instantiation of this scheme for alphabets such as (N, <) and (M, <). When dealing with 
numbers, the partition into a finite number of intervals (and monotone sets in higher dimen¬ 
sions) is very natural and used in many application domains ranging from quantization of 
sensor readings to income tax regulations. It will be interesting to compare the expressive 
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power and succinctness of symbolic automata with other approaches for representing nu¬ 
merical time series and to compare our algorithm with other inductive inference techniques 
for sequences of numbers. 

As a first excursion into the domain, we have made quite strong assumptions on the 
nature of the equivalence oracle, which, already for small alphabets, is a bit too strong and 
pedagogical to be realistic. We assumed that it provides the shortest counter-example and 
also that it chooses always the minimal available concrete symbol. We can relax the latter (or 
both) and even omit this oracle altogether and replace it by random sampling, as already 
proposed in |Ang87| for concrete learning. Over large alphabets, it might be even more 
appropriate to employ probabilistic convergence criteria a-la PAC learning [Val84| and be 
content with a correct classification of a large fraction of the words, thus tolerating imprecise 
tracing of boundaries in the alphabet partitions. This topic is subject to ongoing work. 
Another challenging research direction is the adaptation of our framework to languages 
over Boolean vectors. 
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